Suggested websites

ABSTRACT

An apparatus and a method for suggesting a web site is described. In one embodiment, a user at a client enters a web address. A suggestion server receives the web address and generates one or more suggested web addresses for the user based on an analysis of captured aggregate online users behavior in relation to the web address. The suggestion server stores and organizes results of the analysis of the captured aggregate online users behavior. The results includes a list of two or more substantially related and relevant web addresses. The suggestion server compares the web address with the results of the analysis to generate the one or more suggested web addresses. The one or more suggested web addresses are sent to the client.

TECHNICAL FIELD

Embodiments of the present invention relate to computing systems, andmore particularly, to a method and apparatus for suggesting websites.

BACKGROUND

With the increasing popularity of the Internet and the World Wide Web,it is common for on-line users to utilize search engines to search theInternet for desired information. In order to perform a search, a usersubmits a query containing one or more query terms. A query serverprogram of the search engine processes the query to identify any itemsthat match the terms of the query The set of items identified by thequery server program is referred to as a “query result.” In web-basedimplementations, the query result is typically presented to the user asa hypertextual listing of the located items.

However such queries may not be actual representation of what the usermeant to search. As such, the process of parsing from one web site toanother web site consumes significant time to the user, and increasesthe amount of time the user must spend before reaching a web siterelevant to his/her query. A need therefore exists for a method forsuggesting websites based on collective user behavior and not the actualquery entries.

BRIEF DESCRIPTION OF THE DRAWINGS

The present invention is illustrated by way of example, and not by wayof limitation, in the figures of the accompanying drawings and in which:

FIG. 1 is a block diagram illustrating one embodiment of a system foranalyzing a user online behavior.

FIG. 2 is a block diagram illustrating one embodiment of a system forsuggesting a website.

FIG. 3 is a block diagram illustrating another embodiment of a systemfor suggesting a website.

FIG. 4 is a flow diagram illustrating one embodiment of a method foranalyzing and storing a user online behavior.

FIG. 5 is a flow diagram illustrating one embodiment of a method forgenerating a list of suggested websites to a client using the collecteddata of the client online behavior of FIG. 4.

FIG. 6 is a block diagram illustrating an example of a computer system.

DETAILED DESCRIPTION

Described herein is a method and apparatus for suggesting a web site. Inone embodiment, a user at a client enters a web address. A suggestionserver receives the web address and generates one or more suggested webaddresses for the user based on an analysis of captured aggregate onlineusers behavior in relation to the web address. The suggestion serverstores and organizes results of the analysis of the captured aggregateonline users behavior. The results includes a list of two or moresubstantially related and relevant web addresses. The suggestion servercompares the web address with the results of the analysis to generate anumber of suggested web addresses. The suggested web addresses are sentto the client. In other words, a service is provided where a user canenter a web address, and the service returns a list of pages “like” thatone, based on the criterion that others who have visited that particularweb address have also visited the other sites listed, sorted based onrecency of access and proportion of visitors to the first site whovisited both sites. It is possible that no results are being sent to theclient. In that scenario, there may not have been any matches found.

FIG. 1 is a block diagram illustrating one embodiment of a system foranalyzing a user online behavior. A client 102 includes a web browser103. A user enters a web address in the web browser 103 to access a webpage P1 112 from a web server 106 via a network such as the Internet104. The user at client 102 may then visit another web page P2 114 fromanother web server 108 within a time t₁₋₂ another example, the user atclient 102 may visit another web page P3 116 from another web server 110within a time t₁₋₃. For illustration purposes, each web page is storedwith a corresponding web server. However, those of ordinary skills inthe art will recognize that the different web pages 112, 114, 116 mayalso be stored on a single web server or one or more web servers.

Recency of access is measured using t₁₋₂ and t₁₋₃. How close togetherthe page visits are both temporally and behaviorally. If the user visitsweb page P1 112 and then immediately visits web page P2 114, the visitto web page P2 114 is behaviorally adjacent to the visit to web page P1(or, web page P1 and web page P2 are behaviorally adjacent pages forthat user). In one embodiment, this adjacency relationship index iscalibrated on a continuous −1 to 1 scale, where −1 is not adjacent atall, and 1 is entirely adjacent. Furthermore, the adjacency relationshipindex can be adjusted by time between events For example, web page P1112 and web page P2 114 are more strongly adjacent if the user's act ofclicking on web page P1 112 and web page P2 114 happened closer togetherin time (i.e. t₁₋₂ is less than t₁₋₃).

In one embodiment, the aggregate statistics are collected and used togenerate a ranking of relevant web pages. For example, if 80% of usershave web page P1 112 and web page P2 114 behaviorally adjacent, and 25%of users have web page P1 112 and web page P3 116 behaviorally adjacent,web page P2 114 would rank higher than web page P3 116 on a query forweb page P1 112.

FIG. 2 is a block diagram illustrating one embodiment of a system forsuggesting a relevant website. A client 202 includes a web browser 203.Client 202 communicates with suggestion server 206. In one embodiment,suggestion server 206 receives a web address of a web page 214 of a webserver 212 from web browser 203 via a network such as the Internet 204.

In one embodiment, suggestion server 206 includes a suggestion engine208 coupled to a user behavior database 210. The suggestion engine 208generates a list of one or more relevant web addresses based on ananalysis of captured aggregate online users behavior stored in the userbehavior database 210 in relation to the web address provided by client202. Suggestion engine sends the list to client 202. Browser 203 can acton the list by visiting a suggested web page 218 from web server 216.

In one embodiment, user behavior database 210 stores and organizesresults of the analysis of the captured aggregate online users behavior.Suggestion engine 208 compares the provided web address from client 202with the results of the analysis in behavior database 210 to generatesuggested web addresses.

FIG. 3 is a block diagram illustrating another embodiment of a systemfor suggesting a website. A third party server such as a data collectionserver 302 collects aggregate online users behavior and stores the datain a behavior database 304. Suggestion engine 208 communicates with datacollection server 302 to access the behavior database 304 to analyze theweb address and generate a list of suggested relevant web addresses.

In another embodiment, the data stored in behavior database 304 of datacollection server 302 may be collected using a plug-in 306 installed onthe browser 203 of client 202. The browser plugin 306 can be installedon several computers connected to the Internet 204 to gather a largesample data to generate web addresses that are relevant to each other.The sample data may include temporal and behavioral data of aggregateonline users. For example, the behavioral data includes a behavioraladjacent index for two or more web addresses of web pages visited by auser. A first web page is behaviorally adjacent to a second web pagewhen the user visits the first web page and visits the second web pagewithin a predetermined amount of time. For the temporal data, a firstweb page is behaviorally adjacent to a second web page when the uservisits the first web page and immediately visits the second web page.The temporal data increases the behavioral adjacent index when the uservisits the first web and immediately visits the second web page.

FIG. 4 is a flow diagram illustrating one embodiment of a method foranalyzing and storing a user online behavior. At 402, online userbehavior is collected, analyzed and stored. In particular, behavioraland temporal data are analyzed. At 406, relevant and related web sitesare generated and ranked based on the online user behavior. The websitesare stored and organized in a behavior database. The ranking is affectedby an adjacency behavioral index as previously described. The adjacencybehavioral index is further affected by a temporal relationship index aspreviously described.

FIG. 5 is a flow diagram illustrating one embodiment of a method forgenerating a list of suggested websites to a client using the collecteddata of the client online behavior of FIG. 4. At 510, a suggestionserver receives a web address input from a client. At 512, thesuggestion server generates a list of suggested and relevant web sitesbased on the ranking and behavior analysis performed with respect toFIG. 4. At 514, suggestion server provides a list of suggested web sitesto the client.

FIG. 6 illustrates a diagrammatic representation of a machine in theexemplary form of a computer system 600 within which a set ofinstructions, for causing the machine to perform any one or more of themethodologies discussed herein, may be executed. In alternativeembodiments, the machine may be connected (e.g., networked) to othermachines in a LAN, an intranet, an extranet, or the Internet. Themachine may operate in the capacity of a server or a client machine inclient-server network environment, or as a peer machine in apeer-to-peer (or distributed) network environment. The machine may be apersonal computer (PC), a tablet PC, a set-top box (STB), a PersonalDigital Assistant (PDA), a cellular telephone, a web appliance, aserver, a network router, switch or bridge, or any machine capable ofexecuting a set of instructions (sequential or otherwise) that specifyactions to be taken by that machine. Further, while only a singlemachine is illustrated, the term “machine” shall also be taken toinclude any collection of machines that individually or jointly executea set (or multiple sets) of instructions to perform any one or more ofthe methodologies discussed herein.

The exemplary computer system 600 includes a processing device 602, amain memory 604 (e.g., read-only memory (ROM), flash memory, dynamicrandom access memory (DRAM) such as synchronous DRAM (SDRAM), a staticmemory 606 (e.g., flash memory, static random access memory (SRAM),etc.), and a data storage device 618, which communicate with each othervia a bus 630.

Processing device 602 represents one or more general-purpose processingdevices such as a microprocessor, central processing unit, or the like.More particularly, the processing device may be complex instruction setcomputing (CISC) microprocessor, reduced instruction set computing(RISC) microprocessor, very long instruction word (VLIW) microprocessor,or processor implementing other instruction sets, or processorsimplementing a combination of instruction sets. Processing device 602may also be one or more special-purpose processing devices such as anapplication specific integrated circuit (ASIC), a field programmablegate array (FPGA), a digital signal processor (DSP), network processor,or the like. The processing device 602 is configured to execute theprocessing logic 626 for performing the operations and steps discussedherein.

The computer system 600 may further include a network interface device608. The computer system 600 also may include a video display unit 610(e.g., a liquid crystal display (LCD) or a cathode ray tube (CRT)), analphanumeric input device 612 (e.g., a keyboard), a cursor controldevice 614 (e.g., a mouse), and a signal generation device 616 (e.g., aspeaker).

The data storage device 618 may include a computer-accessible storagemedium 630 on which is stored one or more sets of instructions (e.g.,software 622) embodying any one or more of the methodologies orfunctions described herein. The software 622 may also reside, completelyor at least partially, within the main memory 604 and/or within theprocessing device 602 during execution thereof by the computer system600, the main memory 604 and the processing device 602 also constitutingmachine-accessible storage media. The software 622 may further betransmitted or received over a network 620 via the network interfacedevice 608.

The computer-accessible storage medium 630 may also be used to store thesuggestion engine and behavior database 624 as presently described. Thesuggestion engine and behavior database 624 may also be stored in othersections of computer system 600, such as static memory 606.

While the computer-accessible storage medium 630 is shown in anexemplary embodiment to be a single medium, the term“computer-accessible storage medium” should be taken to include a singlemedium or multiple media (e.g., a centralized or distributed database,and/or associated caches and servers) that store the one or more sets ofinstructions. The term “computer-accessible storage medium” shall alsobe taken to include any medium that is capable of storing, encoding orcarrying a set of instructions for execution by the computer system andthat cause the computer system to perform any one or more of themethodologies of the present invention. The term “computer-accessiblestorage medium” shall accordingly be taken to include, but not belimited to, solid-state memories, optical and magnetic media.

Some portions of the detailed descriptions above are presented in termsof algorithms and symbolic representations of operations on data bitswithin a computer memory. These algorithmic descriptions andrepresentations are the means used by those skilled in the dataprocessing arts to most effectively convey the substance of their workto others skilled in the art. An algorithm is here, and generally,conceived to be a self-consistent sequence of steps leading to a desiredresult. The steps are those requiring physical manipulations of physicalquantities. Usually, though not necessarily, these quantities take theform of electrical or magnetic signals capable of being stored,transferred, combined, compared, and otherwise manipulated. It hasproven convenient at times, principally for reasons of common usage, torefer to these signals as bits, values, elements, symbols, characters,terms, numbers, or the like.

It should be borne in mind, however, that all of these and similar termsare to be associated with the appropriate physical quantities and aremerely convenient labels applied to these quantities. Unlessspecifically stated otherwise as apparent from the following discussion,it is appreciated that throughout the description, discussions utilizingterms such as “processing” or “computing” or “calculating” or“determining” or “displaying” or the like, refer to the action andprocesses of a computer system, or similar electronic computing device,that manipulates and transforms data represented as physical(electronic) quantities within the computer system's registers andmemories into other data similarly represented as physical quantitieswithin the computer system memories or registers or other suchinformation storage, transmission or display devices.

The present invention also relates to apparatus for performing theoperations herein. This apparatus may be specially constructed for therequired purposes, or it may comprise a general purpose computerselectively activated or reconfigured by a computer program stored inthe computer. Such a computer program may be stored in a computerreadable storage medium, such as, but is not limited to, any type ofdisk including floppy disks, optical disks, CD-ROMs, andmagnetic-optical disks, read-only memories (ROMs), random accessmemories (RAMs), EPROMs, EEPROMs, magnetic or optical cards, or any typeof media suitable for storing electronic instructions, and each coupledto a computer system bus.

The algorithms and displays presented herein are not inherently relatedto any particular computer or other apparatus. Various general purposesystems may be used with programs in accordance with the teachingsherein, or it may prove convenient to construct more specializedapparatus to perform the required method steps. The required structurefor a variety of these systems will appear from the description below.In addition, the present invention is not described with reference toany particular programming language. It will be appreciated that avariety of programming languages may be used to implement the teachingsof the invention as described herein.

It is to be understood that the above description is intended to beillustrative, and not restrictive. Many other embodiments will beapparent to those of skill in the art upon reading and understanding theabove description. The scope of the invention should, therefore, bedetermined with reference to the appended claims, along with the fullscope of equivalents to which such claims are entitled.

1. A computer-implemented method comprising: receiving a web addressfrom a client; generating one or more suggested web addresses to theclient based on an analysis of captured aggregate online users behaviorin relation to the web address; and sending the one or more suggestedweb addresses to the client.
 2. The computer-implemented method of claim1 further comprising; storing and organizing results of the analysis ofthe captured aggregate online users behavior in a storage device, theresults including a list of two or more substantially related andrelevant web addresses; and comparing the web address with the resultsof the analysis in the storage device to generate the one or moresuggested web addresses.
 3. The computer-implemented method of claim 1further comprising: communicating with a third party server that storesand organizes results of the analysis of the captured aggregate onlineusers behavior in a storage device, the results including a list of twoor more substantially related and relevant web addresses, wherein thethird party server compares the web address received from the clientwith the results of the analysis in the storage device to generate theone or more suggested web addresses.
 4. The computer-implemented methodof claim 2 further comprising: providing a plug-in application to webbrowsers of one or more users to capture the aggregate online usersbehavior, wherein the plug-in application captures and sends datarelated to temporal and behavioral of aggregate online users.
 5. Thecomputer-implemented method of claim 1 wherein the captured aggregateonline users behavior includes behavioral data and temporal data.
 6. Thecomputer-implemented method of claim 5 wherein the behavioral dataincludes a behavioral adjacent index for two or more web addresses ofweb pages visited by a user, a first web page behaviorally adjacent to asecond web page when the user visits the first web page and visits thesecond web page within a predetermined amount of time.
 7. Thecomputer-implemented method of claim 5 wherein the temporal dataincludes two or more web addresses of web pages visited by a user, afirst web page behaviorally adjacent to a second web page when the uservisits the first web page and immediately visits the second web page,the temporal data increasing the behavioral adjacent index when the uservisits the first web and immediately visits the second web page.
 8. Aserver comprising: a processing device; a memory coupled to theprocessing device; and a suggestion engine coupled to the processor, thesuggestion engine configured to receive a web address from a client, togenerate one or more suggested web addresses to the client based on ananalysis of captured aggregate online users behavior in relation to theweb address, and to send the one or more suggested web addresses to theclient.
 9. The server of claim 8 further comprising: a storage devicecoupled to the suggestion engine, the storage device configured to storeand organize results of the analysis of the captured aggregate onlineusers behavior, the results including a list of two or moresubstantially related and relevant web addresses, wherein the suggestionengine is configured to compare the web address with the results of theanalysis in the storage device to generate the one or more suggested webaddresses.
 10. The server of claim 8 further comprising: a communicationinterface configured to communicate with a third party server thatstores and organizes results of the analysis of the captured aggregateonline users behavior in a storage device, the results including a listof two or more substantially related and relevant web addresses, whereinthe third party server compares the web address received from the clientwith the results of the analysis in the storage device to generate theone or more suggested web addresses.
 11. The server of claim 9 whereinthe suggestion engine provides a plug-in module to web browsers of oneor more users to capture the aggregate online users behavior, whereinthe plug in application captures and sends data related to temporal andbehavioral of aggregate online users.
 12. The server of claim 8 whereinthe captured aggregate online users behavior includes behavioral dataand temporal data.
 13. The server of claim 12 wherein the behavioraldata includes a behavioral adjacent index for two or more web addressesof web pages visited by a user, a first web page behaviorally adjacentto a second web page when the user visits the first web page and visitsthe second web page within a predetermined amount of time.
 14. Theserver of claim 12 wherein the temporal data includes two or more webaddresses of web pages visited by a user, a first web page behaviorallyadjacent to a second web page when the user visits the first web pageand immediately visits the second web page, the temporal data increasingthe behavioral adjacent index when the user visits the first web andimmediately visits the second web page.
 15. A computer-accessiblestorage medium including data that, when accessed by a computer system,cause the computer system to perform a method comprising: receiving aweb address from a client; generating one or more suggested webaddresses to the client based on an analysis of captured aggregateonline users behavior in relation to the web address; and sending theone or more suggested web addresses to the client.
 16. Thecomputer-accessible storage medium of claim 15 wherein the methodfurther comprises: storing and organizing results of the analysis of thecaptured aggregate online users behavior in a storage device, theresults including a list of two or more substantially related andrelevant web addresses; and comparing the web address with the resultsof the analysis in the storage device to generate the one or moresuggested web addresses.
 17. The computer-accessible storage medium ofclaim 15 wherein the method further comprises: communicating with athird party server that stores and organizes results of the analysis ofthe captured aggregate online users behavior in a storage device, theresults including a list of two or more substantially related andrelevant web addresses, wherein the third party server compares the webaddress received from the client with the results of the analysis in thestorage device to generate the one or more suggested web addresses. 18.The computer-accessible storage medium of claim 16 wherein the methodfurther comprises: providing a plug-in application to web browsers ofone or more users to capture the aggregate online users behavior,wherein the plug-in application captures and sends data related totemporal and behavioral of aggregate online users.
 19. Thecomputer-accessible storage medium of claim 15 wherein the capturedaggregate online users behavior includes a behavioral adjacent index fortwo or more web addresses of web pages visited by a user, a first webpage behaviorally adjacent to a second web page when the user visits thefirst web page and visits the second web page within a predeterminedamount of time.
 20. The computer-accessible storage medium of claim 19wherein the captured aggregate online users behavior includes a temporaldata that increases the behavioral adjacent index when the user visitsthe first web and immediately visits the second web page.